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1.  INTRODUCTION 


The  basic  mathematical  model  of  the  parametric  adjustment  expresses  each  of 
the  observables  in  terms  of  parameters,  where  the  structure  tying  together  the 
two  groups  of  variables  is  in  general  nonlinear.  The  present  study  addresses 
the  resolution  of  such  an  adjustment  problem  through  an  isomorphic  geometrical 
setup  with  tensor  structure  and  notations.  The  number  of  observables  is  denoted 
by  n  and  the  number  of  parameters  by  u,  where  n>u  for  an  adjustment  to  take 
place.  The  indices  identifying  the  elements  in  either  group  are  written  as 
superscripts,  conforming  to  tensor  notations  for  coordinates.  Thus,  the 
observables  in  the  parametric  model  are  represented  by 

x  =  x  (u  )  ,  (1) 

where  the  Roman-letter  and  the  Greek-letter  indices  vary  respectively  as 

r  =  1 ,  2 . n;  a  =  1 ,  2 . u.  ( 2a , b ) 

r  <x 

The  actual  observations  of  x  (u  )  are  in  general  inconsistent  with  the 
parameters  ua,  giving  rise  to  the  discrepancies 

,-r  r  r.  a.  , 

dx  =  x  -  x  (u  )  .  (3) 

p 

Here  and  in  the  sequel,  the  symbol  x  is  reserved  for  the  observations,  while 

r  (x  r 

the  observables  are  represented  by  x  (u  )  or  other  symbols  distinct  from  x  . 

In  a  standard  procedure,  (1)  is  subject  to  the  Taylor  expansion 

r.  «,  r  r„  r._  a,  .  a  r-,2  r  a _  B,  ,  a.  ... 

x  (u  )  =  xq  +  [3x  /3u  ]Qdu  +  (l/2)[3  x  /3u  3u JQdu  du  +  ...  ,  (4a) 

.  a  a  a  .... 

du  =  u  -  u  ,  (4b) 

o 

where  the  tensor  symbolism  implies  the  summation  convention  over  the  dummy 

(repeating)  indices.  The  subscript  "o"  corresponds  here  to  the  initial  values 

ol  r  r  ot 

of  the  parameters,  u  ,  so  that  x  sx  (u  ).  Upon  introducing  the  notation 
r 

dx  ,  such  that 

.  r  r  r  , , . 

dx  =  x  -  x  (5) 

o 

(3)  in  conjunction  with  (4a)  form  the  nonlinear  observation  equations 

dxr  -  [3xr/3uaJ  dua  +  (1/2) [32xr/3ua3u^]  duadu^  +  ...  +  dxr  .  (6) 

'o  Jo 


V 

,\* 


The  Gauss'  form  of  a  two-dimensional  surface  (u=2)  embedded  In  a  three- 
dimensional  flat  space  (n=3)  is  described,  together  with  two  other  forms,  in 
Chapter  6  of  [Hotine,  1969].  In  [Blaha,  1984],  which  treated  only  the 
linearized  parametric  adjustment,  both  the  n-dimensional  "observational"  space 
and  the  u-tlimensional  "model"  surface  were  considered  flat.  The  latter  was  thus 
in  reality  a  hyperplane.  Although  the  model  surface  is  now  intrinsically  a 
curved  space,  the  surrounding  observational  space  can  again  be  considered  flat. 
This  stems  from  the  representation  of  variance-covariance  matrices  by  associated 
metric  tensors  and  of  weight  matrices  by  metric  tensors  as  demonstrated  in 
[Blaha,  1984].  Since  the  variance-covariance  and  the  weight  matrices  do  not 
change  with  the  coordinates  ua  as  discussed  in  the  preceding  paragraph,  the 

p  s 

associated  metric  tensor  g  and  the  metric  tensor  g  for  the  observational 

sr 

space  are  constant,  at  least  insofar  as  displacements  along  the  model  surface 
are  concerned.  And  in  the  absence  of  any  conflicting  information  or 
restriction,  these  tensors  are  assumed  constant  throughout  the  observational 
space.  The  latter  is  thus  flat  as  an  admissible  and  tangible  outcome. 

In  general,  a  flat  space  can  be  described  via  Cartesian  coordinates.  In  a 
Cartesian  coordinate  system,  a  given  point  is  depicted  by  a  set  of  coordinates 

P 

x  ,  r=l,2 . n,  which  can  be  interpreted  as  its  position  vector  (p)  expressed 

by  contravariant  components,  pr=xr.  Considered  as  a  tensor,  and  thus  as  a  point 
r 

function,  p  is  associated  with  this  given  point  (and  not,  for  example,  with  the 

origin).  Accordingly,  if  the  observational -space  coordinates  were  Cartesian, 
r  r  <x 

xq2x  (uq)  would  represent  the  position  vector  associated  with  the  model - 
surface  point  called  P.  Similarly,  (1)  would  express  a  family  of  position 
vectors  associated  with  the  model  surface.  Any  of  these  position  vectors  could 
be  freely  parallel-transported  to  the  point  P  and  eventually  give  rise  to  tensor 
equations  there.  Although  derived  in  Cartesian  coordinates,  such  equations 
would  be  valid  in  any  coordinates  applicable  to  a  flat  space. 

In  multiplying  the  Cartesian  coordinates  xr=xr(u°)  by  the  appropriate 
transformation  factors,  one  obtains  the  contravariant  components  of  the  same 
family  of  position  vectors  in  another  coordinate  system.  Equation  (1)  can  thus 
be  broadly  interpreted  as  expressing  a  family  of  position  vectors,  whose 
functional  relationship  to  the  coordinates  ua  depends  in  part  on  the  above 
transformation  factors,  i.e.,  on  the  observational -space  coordinate  system  in 
use.  If  the  latter  were  completely  general,  the  position  vectors  could  not  be 


2.  GEOMETRICAL  RESOLUTION 


2. 1  Geometrical  Setup 

The  geometrical  setup  corresponding  to  the  nonlinear  adjustment  model  Is 
schematically  illustrated  in  Fig.  1.  In  accordance  with  the  discussion  in  the 
Introduction,  the  observations  are  interpreted  as  the  contravariant  components 

P 

of  the  position  vector  x  of  the  point  denoted  Q.  The  observables  consistent 

with  the  adjustment  model  correspond  to  a  family  of  position  vectors  xr(u®)  to 

r  r  a 

the  model  surface.  In  particular,  x  sx  (u  )  identifies  the  position  vector 

— r  r  ol  - 

to  the  point  P,  and  x  =x  (u  ( P ) )  identifies  the  position  vector  to  the  point 
P,  where  the  model-surface  coordinates  of  P  and  P  are  u®  and  u®(P), 
respectively.  The  difference  in  the  model-surface  coordinates  between  P  and  P 
is  denoted  by  du®,  so  that 

du®  =  u®(P)  -  u®  .  (8) 

In  the  course  of  the  development,  the  model-surface  point  P  is  considered 
variable,  and  is  restricted  to  a  small  neighborhood  of  the  fixed  point  P. 
Accordingly,  the  coordinate  differences  du®  are  variable.  As  two  special  cases, 
P  can  coincide  with  P,  or  can  coincide  with  the  point  denoted  P  in  Fig.  1. 

The  differences  in  the  observational -space  coordinates  between  P  and  P 
are  denoted  as 

r  -r  r 

Ax  =  x  -  x  (9) 

o 

which  also  describes  the  differences  in  the  contravariant  components  of  the 
position  vectors  belonging  to  two  distinct  points  in  space,  namely  P  and  P. 

In  general,  unless  these  position  vectors  are  parallel-transported  to  a  common 

r 

point,  (9)  is  not  a  tensor  equation  and  Ax  is  not  a  tensor.  However,  in  the 
special  observational -space  coordinates  with  a  constant  metric  tensor,  either  of 

these  vectors  can  be  freely  parallel-transported  to  any  location.  It  is  then 

-r  - 

convenient  to  regard  x  as  paral lei -transported  from  P  to  P,  in  which  case 

P 

(9)  is  a  tensor  equation  at  P  and  Ax  is  a  tensor  at  P.  This  will  allow 

p 

Ax  to  participate  in  other  tensor  equations  at  P.  The  geometrical  object 

P 

described  by  the  contravariant  components  Ax  can  be  symbolized  by  an  arrow 
from  P  to  P,  referred  to  as  the  vector  PP.  In  Fig.  1,  this  vector  is 
denoted  as  Ax,  and  its  tip  depicts  one  special  location  of  the  point  P. 

5 


The  differences  in  the  observational-space  coordinates  between  Q  and  P  are 
denoted  by 

dxr  =  xr  -  x£  ,  (10) 

which  has  already  appeared  as  equation  (5).  In  analogy  to  the  above,  dxr  also 
represents  the  differences  in  the  contravariant  components  of  the  position 
vectors  belonging  to  Q  and  P,  respectively.  Since  the  position  vector  belonging 

to  Q  can  again  be  freely  parallel-transported  to  P,  (10)  is  a  tensor  equation  at 

r  r 

P  and  dx  is  a  tensor  at  P.  The  geometrical  object  described  by  dx  is 

symbolized  in  Fig.  1  by  the  vector  dx.  Similarly,  the  differences  in  the 

observational-space  coordinates  between  Q  and  P  are  denoted  by 

— r  r  —  r 

dx  =  x  -  x  ,  (11) 

which  is  essentially  equation  (3).  The  position  vector  belonging  to  Q  can  be 
freely  parallel-transported  to  P,  in  which  case  (11)  is  a  tensor  equation  at 

-  -  p  -  -  p 

P  and  dx  is  a  tensor  at  P.  The  geometrical  object  described  by  dx  is 

symbolized  in  Fig.  1  by  the  vector  dx.  As  another  possibility,  both  position 

vectors  can  be  freely  parallel-transported  to  P,  in  which  case  (11)  is  a  tensor 

-  r 

equation  at  P  and  dx  is  a  tensor  at  P.  In  terms  of  Fig.  1,  the  geometrical 
object  described  by  this  tensor  can  be  Imagined  as  a  vector  of  the  same 
magnitude  and  orientation  as  the  vector  dx  already  depicted,  but  drawn  from  P. 

The  latter  possibility  is  particularly  useful.  It  results  in  three  tensor 
equations  at  P,  namely  (9),  (10),  and  (11),  yielding 

dxr  =  Axr  +  dxr  ,  (12) 

p 

which  is  again  a  tensor  equation  at  P.  Since  the  numerical  values  x  are  given 

p 

(they  represent  the  actual  observations),  and  the  numerical  values  of  x  are 

(X  p 

fixed  as  a  function  of  the  fixed  coordinates  u  ,  the  tensor  dx  is  known  from 

o 

'■  r 

(10)  as  a  part  of  the  geometrical  setup.  However,  the  numerical  values  of  x 
are  unknown  (they  represent  the  adjusted  observations),  and  so  are  the  values  of 

r  -p  p 

Ax  and  dx  .  On  the  other  hand,  Ax  can  be  expressed  as  a  function  of 

the  coordinate  differences  dua  in  (8).  Since  these  differences  are  linked  to 

-  -  p 

the  position  of  the  variable  point  P,  and  since  the  tensor  dx  is  also 
linked  to  the  position  of  P,  it  follows  that  a  variety  of  realistic  criteria 

-  p  (X 

pertaining  to  dx  could  lead  to  the  determination  of  du  and  thus  to  the 
resolution  of  the  entire  setup. 
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this  choice  appealing  geometrically  for  its  simplicity  and  objectivity,  but  it 

represents  at  the  same  time  the  desired  least-squares  {L.S.)  solution.  Since 

the  residual  vector  V  from  adjustment  calculus  corresponds  to  -dx  ,  and  the 

adjustment  weight  matrix  P  corresponds  to  g  ,  the  familiar  L.S.  quadratic  form, 
ij»  sr 

V  PV=minlmum,  corresponds  to 

~  s  -r  ~2 
dx  g  dx  =  ds  =  minimum  , 
sr 

where  ds  is  the  length  of  the  vector  dx.  But  the  minimum-length  criterion 
for  dx  means  that  this  vector  must  be  orthogonal  to  the  model  surface.  In 
accordance  with  most  practical  cases,  we  assume  that  the  small  neighborhood  of  P 
contains  only  one  model-surface  point  satisfying  the  orthogonality  requirement, 
and  that  the  latter  entails  a  minimum,  rather  than  a  maximum,  length  ds. 

A  situation  of  this  kind  is  reflected  in  the  top  portion  of  Fig.  1,  indicating 
that  one  is  concerned  with  the  point  P  and  not  with  Z. 


Tensor 


We  begin  with  the  system  of  equations  (1),  interpreted  in  the  Introduction 
as  the  Gauss'  form  of  a  u-dlmenslonal  surface  embedded  in  an  n- dimensional  flat 
space.  This  surface  has  been  called  the  model  surface,  and  the  surrounding 
space  has  been  called  the  observational  space.  With  respect  to  a  given  surface 
point  P,  an  infinitesimal  displacement  along  the  surface  can  be  described  either 
in  the  surface  coordinates  by  the  coordinate  differences  dua,  a=l,2 . u,  or  in 

p 

the  space  coordinates  by  the  coordinate  difference!  dx'  ,  r=l,2 . n.  The  two 

are  related  by  the  ordinary  formula  for  total  differentiation, 

dx '  r  =  Oxr/3ua)du°  ,  (17) 


where  the  partial  derivatives  are  evaluated  at  P. 

Although  the  surface  coordinate  system  is  in  general  curvilinear,  dua  can 
be  regarded  not  only  as  a  set  of  coordinate  differences,  but  also  as 
contravariant  components  of  a  small  vector.  Such  a  dual  role  is  explained  by  the 
statement  on  page  5  of  [Hotine,  1969]:  "Over  short  distances,  we  can, 
nevertheless,  consider  that  the  coordinate  lines  are  straight  in  a  curvilinear 
system.  By  analogy  with  the  Cartesian  definition,  we  still  can  say  that  a  small 
change  in  coordinates ...  represents  the ... contravariant  components  of  a  small 
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a 

where  Sa  is  the  Kronecker  delta,  we  can  write 
P 

.r  .  r  ...  |S  .r  „  .r  . 

=  9x  /0uH  =  s  *|3  +  J  Jp  +  •  •  ■  • 

and  thus  confirm  (14a). 

In  retracing  the  above  derivation,  we  conclude  that  the  dual  role  of  dua 

p 

(and  also  of  dx'  )  as  a  set  of  small  coordinate  differences  and  as  a  tensor  at 

the  surface  point  P  has  likewise  resulted  in  a  dual  role  of  Ar.  Specifically, 
r  ol 

as  0x  /9u  it  can  be  regarded  as  a  set  of  partial  derivatives  relating  the 

r  r 

space  and  the  surface  coordinates  at  P.  And  as  8  i  tj  j  *...  it  can  be 

a  a 

regarded  as  a  mixed  tensor  at  P,  transforming  like  a  space  tensor  in  the 
contravariant  indices  and  like  a  surface  tensor  in  the  covariant  indices. 
Except  for  the  higher-dimensionality  of  (14a),  this  equation  has  the  same  form 
as  the  tensor  equation  6.09  in  [Hotine,  1969],  although  the  latter  was  derived 
by  other  means.  The  important  property  of  (14a)  in  tensor  form  is  that  it  is 
based  on  the  space  and  the  surface  components  of  the  orthonormal  vectors 
t,  j,...  at  P,  which,  in  geometrical  interpretation,  span  the  model  plane. 
These  vectors  are  quite  arbitrary,  in  the  sense  that  any  such  set  leads  to  the 

p 

same  tensor  A^.  Due  to  its  correspondence  to  the  design  matrix  A  in 
adjustment  calculus,  which  can  be  noticed  upon  comparing  (7)  and  (7’),  A^  is 
called  the  design  tensor. 


2 . 3  Least-Squares  Soluti on 


Since  the  values  of  the  coordinate  differences  dua  are  small  by 
construction,  a  good  approximation  to  the  final  solution  dua  can  be  obtained 
through  a  linearized  model  at  P.  In  the  geometrical  context,  such  a  model 
corresponds  to  (16)  without  the  derivatives  beyond  the  first  order,  i.e.,  to 


.  r  r  of  r 

dx  =  A  du  +  dx"  (19 

a 

-  r  p 

where  the  notation  dx  has  been  replaced  by  dx"  .  In  terms  of  Fig.  1 ,  the 
vector  dx  remains  unaltered,  but  the  model  surface  is  replaced  by  the  model 

P 

plane.  If  the  adjustment  model  were  Indeed  linear,  A  would  be  constant 

a 

p 

throughout  the  model  plane  due  to  0  .50  In  (14 b).  The  mode)  surface  metric 

ap 

tensor  at  P  would  be  formed  as 


Upon  comparing  (21)  and  (22),  we  assert  that  the  L.S.  solution  (22)  of  the; 
linear  model  (19)  is  the  projection  of  dx  onto  the  model  plane.  In  Fig.  1,  the 
projected  vector  PQ  is  denoted  as  (du).  In  terms  of  the  given  quantities,  the 
projected  vector  in  covariant  components  is 


(dV  =  Aj3gsrdxr  =  ae,3  +  bj0  +  •••  ■ 


(22'  ) 


which  is  a  tensor  equation  at  P.  The  contravar lant  version  of  (22')  is  obtained 


upon  contracting  the  latter  with  a  p: 

,  ,  a.  ai 9. s  .  r 

(du  )  =  a  A^g^dx 


(22") 


This  equation  gives  the  result  identical  to  (22).  It  expresses  the  L.S. 
solution  of  the  coordinate  differences  in  the  linearized  model,  as  well  as  the 
contravar iant  components  of  the  projection  of  the  vector  dx  onto  the  model 
plane.  Since  the  L.S.  solution  of  the  linearized  model  is  assumed  to  be  dose 
to  the  final  L.S.  solution  of  the  nonlinear  model,  the  tensor  (du®)  in  (22") 
offers  a  good  first  approximation  to  the  coordinate  differences  du®  in  the 
nonlinear  model. 

In  returning  to  the  covariant  components  of  the  projected  vector,  we 
rewrite  the  tensor  equation  (22')  as 

(dfp)  =  Ap£srdxr  •  (23a) 

A  similar  tensor  equation  can  be  formed  at  the  variable  point  P,  namely 

(dUg)  =  A®ggrdxr  .  (23b) 

In  the  special  case  where  dx  is  orthogonal  to  the  model  surface,  the  projected 
vector  (du)  is  zero.  The  barred  tensors  on  the  right-hand  side  of  (23b)  can 
be  derived  from  their  unbarred  counterparts: 


K  -  *;  *  s  * 
<**;  ■  ■ 


(24a) 


(24b) 


.  r  .  r  .  r 
dx  =  dx  -  Ax 


Ax  r  -  dx ’ r  *  d ' x ' r  f 


how  far  we  can  carry  the  general  development  without  applying  a  specific 
criterion  for  the  vector  dx. 


If  the  solution  of  the  problem  at  hand  should  conform  to  the  L.S. 

criterion,  dx  must  be  orthogonal  to  the  model  surface  as  stated  earlier.  The 

variable  point  P  must  then  coincide  with  the  point  denoted  P  in  Fig.  1. 

Accordingly,  the  projection  of  dx  on  the  model  surface  at  P  is  zero,  and 

thus  (du.)  on  the  left-hand  sides  of  (23b),  (26),  and  (27)  is  replaced  by 
P 

zero.  At  this  stage,  (27)  is  contracted  with  aap ,  yielding 

du®  =  (du®)  +  aa^dA^g  dxr  -  Pad'x,r  +  ...  ,  (28) 

0  sr  r 

where 


„a  a0 .  s 
P  =  a  A„g 
r  0&sr 


(29a) 


which  is  a  tensor  at  P  that  may  be  called  the  projection  operator  (onto  the 
model  plane).  In  terms  of  this  tensor,  (22")  can  be  written  as 


.  .  a.  n«  ,  r 
(du  )  =  P  dx 
r 


(29b) 


where  (du)  is  the  projection  of  dx  onto  the  model  plane.  Of  the  other 
quantities  in  (28),  dA®  is  given  by  (24b),  dxr  is  given  by  (25a-d),  d'x,r  is 

r 

given  by  (25d)  alone,  and  a  ,  the  "inverse"  of  a.  ,  follows  from  (20a, b).  On 

r 

the  other  hand,  the  tensor  dx  (corresponding  to  the  observations  via  equation 

i*s 

5),  the  tensor  g  and  thus  its  "inverse"  ggr  (corresponding  respectively  to  the 
variance-covariance  and  the  weight  matrices  of  the  observations),  the  design 

r 

tensor  A  (corresponding  to  the  design  matrix),  and  the  sets  of  partial 

P 

derivatives  0  . .  are  known  a  priori. 

«p 

All  of  the  tensors  in  (28)  except  (du®)  contain  the  desired  coordinate 
differences  du”  between  the  point  P  representing  the  L.S.  solution  and  the 
fixed  point  P.  And  as  the  preceding  paragraph  has  demonstrated,  with  the 
exception  of  the  coordinate  differences  du®  themselves,  all  of  the  quantities  in 
(28)  are  known,  including  the  tensor  (du®)  obtained  as  the  solution  to  the 
linearized  model.  The  relation  (28)  thus  represents  u  nonlinear  equations  in  u 
unknowns.  Since  the  solution  (du®)  is  a  good  initial  approximation  to  the 
desired  values  dua,  all  of  the  sets  du®  on  the  right-hand  side  could  be  replaced 
by  (du®)  in  what  can  be  termed  the  first  geometrical  iteration.  Subsequently, 
the  improved  set  du®  computed  as  the  left-hand  side  of  (28)  can  be  substituted 


straightforward  fashion  as  indicated  by  the  dots.  However,  this  is  not  foreseen 
as  being  necessary  or  desirable  in  practice,  where  the  coordinates  u*  are 
usually  close  to  their  final  values  and  the  terms  containing  higher  than  second 
powers  of  dua  are  completely  negligible.  The  terms  developed  explicitly  in  this 
study  already  go  one  order  of  refinement  beyond  the  usual  linearization. 

In  transcribing  the  complete  iterative  procedure  in  matrix  notations 

consistent  with  the  above  specifications,  we  begin  by  grouping  the  observations 

in  the  (column)  vector  x,  and  grouping  the  observables  associated  with  the 

coordinates  ua  in  the  (column)  vector  x  .  This  leads  to  the  formation  of  the 
o  o 

vector  dx  according  to  (5),  namely 

dx  =  x  -  xq  ,  (30a) 

where  each  vector  has  the  dimensions  (nxi).  The  variance  covariance  matrix  of 
observations  is  g,  and  their  weight  matrix  is 

g*  «  g  1  .  (30b) 

where  either  matrix  has  the  dimensions  (nxn).  The  design  matrix  A  of  dimensions 

P 

(nxu)  corresponds  to  in  (14a): 

A  h  [dxr/au®]  ,  (30c) 

where  the  brackets  indicate  grouping  of  elements,  the  contravariant  indices 
identifying  the  rows  and  the  covariant  indices  identifying  the  columns.  The 
partial  derivatives  of  the  elements  in  A  are  grouped  in  a  three-dimensional 
array  {3A/3u},  such  that 

(3A/3u)  =  [3A/9U1 ] , [3A/0u2l . [3A/3uU]  ,  (30d) 

where  each  pair  of  brackets  defines  a  matrix  of  dimensions  (nxu),  and  where 

these  u  matrices  are  separated  from  each  other  by  commas.  The  matrices  in 

(30c, d)  are  evaluated  with  the  set  ua=ua.  All  of  the  quantities  in  (30a  d) 

o 

are  considered  to  be  known  a  priori. 


The  starting  values  for  the  final  L.S.  solution  du  are  obtained  from  the 

linearized  model  associated  with  the  above  set  ua  u°\  In  particular,  one 

o 

computes  successively: 
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With  all  the  quantities  for  the  right-hand  side  of  (28)  available  and 
expressed  in  matrix  notations,  this  equation  is  transcribed  as 

du  =  (du)  *■  adA^g*dx  -  Pd'x'  *  ...  (33) 

The  new  du  above  is  used  for  the  right-hand  side  in  the  next  iteration,  which 
begins  with  the  computation  of  the  updated  values  in  (32a-e).  In  the  first 
geometrical  iteration,  the  "new"  du  is  (du).  When  the  elements  of  du  no  longer 
change,  or  change  by  negligible  amounts,  the  iterative  process  is  terminated. 


2 . 4  Complete  Resolution 


Upon  completion  of  the  iterative  process,  the  final  coordinate  values 
describing  the  desired  L.S.  point  P  are 

u  (P)  =  u  +  du  ,  (34) 

which  follows  from  (8).  The  geometrical  iterations  also  yield,  as  a  by  product, 
-  r 

the  values  of  dx  corresponding  to  the  minus  residuals: 

dxr  =  dxr  -  Axr  ,  (35a) 


where 

Axr  =  dx'r  +  d'x'r  +  ...  (35b) 

r  r 

Equations  (35a, b)  as  well  as  the  expressions  for  dx'  and  d'x'  have  been 

-  f 

presented  in  (25a  d).  The  position  vector  x  of  t ho  L.S.  point  P  follows 
from  (9)  and  (11)  as 

~r  r  ,  r  r  ,  r  .  _  _ , 

x=x+Ax=x-dx.  (36) 

o 

-  p 

The  values  of  x  correspond  to  the  adjusted  observations. 

In  addition  to  the  parametric  solution,  the  L.S.  adjustment  process  should 
also  yield  the  variance  covariance  matrices  of  the  adjusted  parameters  and 
observations,  and  possibly  of  linear  functions  of  the  adjusted  parameters.  To 
this  we  shall  add  the  weight  matrices  of  the  adjusted  parameters  and 
observations.  Clearly,  all  these  matrices  could  be  computed  by  forming  the 
matrix  of  normal  equations  corresponding  to  the  point  P  and  inverting  it. 
However,  in  a  consistent  effort  to  avoid  any  inversion  beyond  the  solution  of 
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In  expanding  a  in  the  Taylor  series,  we  take  advantage  of  the  result  just 
derived  and  of  the  basic  equation  (20b) ,  rewritten  as 


az  ^  « 

a  a  =  <5 
re  e 


Upon  differentiating  the  latter  with  respect  to  u^,  the  right-hand  side  becomes 
zero.  The  contraction  of  the  resulting  equation  with  a£^  produces 

aa^/3ur  •  -aa“oa  /»'')«**  . 

and  thus 

daa^  =  (3aa^/3u^)  du7^  =  -aa^da  a£^  , 

which  is  symmetric  in  the  indices  ct  and  0,  as  is  confirmed  upon  using  the 
symmetry  in  da  .  If  we  differentiate  3aa^/3u^  with  respect  to  u^,  and 
contract  the  new  equation  with  du'du  ,  we  obtain 

d’aa^  =  ( 1/2)  (32aa^/3u^3u^)  du^du*5 

=  -a^d’a^a6^  -  (1/2)  (S°^  +  S^“)  , 


where 


s’“  -  a'"' da  dae(J  -  s*” 


The  second  equality  above  is  the  consequence  of  the  formula  giving  da  p,  and  of 
the  symmetry  in  a*^  as  well  as  in  da 

pe 

In  collecting  the  results,  we  write  the  final  outcome  as 


-  a“3  ♦  Aaa^ 


a/3  a&  ,  a/3 
*=daN+d'a'+ 


.“"da  as3 


d'a”*  .  -a“"d'a  aEfJ  -  Saf 


(38a) 


_ai9  ap.  .  e/9 
S  =  a  da  da 


( 3Be ) 


all  of  which  are  symmetric  in  a  and  /9 .  Upon  considering  (37b),  equations 
(38b  e)  could  be  replaced  by  a  more  concise  expression: 


(43) 


a"  =  g  -  o'  =  a  -  o' 

sr  sr  Bsr  6sr  *sr  ’ 

where  the  second  equality  is  again  due  to  the  constant  ggr,  and  ggr  is  given 
above  by  (42).  As  a  natter  of  interest,  one  can  form  a  quantity  similar  to 

-p 

A  in  (39),  and  associate  it  with  linear,  or  linearized,  functions  of  the 
adjusted  parameters.  In  this  case,  all  the  quantities  in  (39)  except  a  p 
would  be  attributed  the  symbol  The  resulting  tensor  at  P,  called  the 
necessary  associated  metric  tensor  in  the  functional  space,  would  represent  the 
variance-covariance  Matrix  of  these  functions.  However,  the  weight  matrix  of 
such  functions  could  not  be  expressed  in  analogy  to  (42)  because  the  metric 
tensor  in  the  functional  space  is  unknown. 

The  key  formulas  derived  in  this  section  will  now  be  transcribed  in  matrix 
notations  in  accordance  with  the  convention  introduced  earlier.  In  grouping  the 
u  adjusted  parameters  in  the  (column)  vector  u,  one  can  transcribe  (34)  as 

u  =  u  +■  du  .  (44) 

o 

The  minus  residual  vector  follows  from  (35a, b)  as 

dx  =  dx  -  Ax  ,  (45a) 


where 


Ax  =  dx'  +  d'x'  + 


(45b) 


Equations  (45a, b)  as  well  as  the  expressions  for  dx'  and  d'x'  have  already 
appeared  in  (32b-e).  With  regard  to  the  adjusted  observations,  (36)  gives 

x=x  +  Ax  =  x  -  dx  .  (46) 


The  weight  matrix  of  the  adjusted  parameters,  a*,  follows  from  (37a  e): 


a*  =  a*  +  Aa*  , 

(47a) 

Aa*  =  da*  ♦  d'a*  , 

(47b) 

da*  =  N  ♦  NT  , 

(47c) 

N  =  ATg*dA  , 

<47d) 

d’a*  =  dATg*dA  . 

( 47e ) 
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Finally,  the  weight  aatrix  of  the  adjusted  observations  is  transcribed  from 
(42)  as 

g*'  =  g*g'g*  .  (52) 

where  g'  is  given  by  (49).  Por  the  weight  Matrix  of  the  residuals,  froa  (43) 
we  have 

g*"  -  g*  -  g*’  .  (53) 

with  g*'  given  above  by  (52).  The  weight  aatrix  of  functions  of  the  adjusted 
paraaeters  cannot  be  coaputed  in  analogy  to  (52)  because  the  weight  aatrix  in 
the  ’’functional  space”  is  unknown. 

As  a  aatter  of  interest,  we  present  a  foraula  serving  for  the  verification 
of  the  final  L.S.  outcoae.  This  relation  has  been  encountered  in  tensor  fora  as 
a  by-product  of  the  geoaetrlcal  development  in  Section  2.3.  In  particular, 
equation  (23b)  and  the  stateaent  Immediately  following  it  have  established  that 
if  the  vector  dx  is  orthogonal  to  the  aodel  surface,  the  vector  (du)  is 
zero.  But  this  occurs  precisely  when  the  variable  point  P  mentioned  prior  to 
(23b)  becomes  the  desired  L.S.  point  P.  Accordingly,  upon  the  convergence  of 
the  L.S.  process,  it  aust  hold  true  that 

ATg*dx  =  0  , 

which  is  the  matrix  form  of  (23b)  with  (duo)=0. 

P 
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dii  implicated  in  the  second,  third,  and  further  terms  on  the  right-hand  side  is 
(du).  The  quantities  computed  with  this  vector  will  be  symbolized  here  by 
parentheses  as  well.  Thus,  we  write 

du(1)  =  (du)  +  a(dAT)g*(dx)  -  aATg*(d'x')  +  ...  ,  (57) 

where 


(dA)  =  (3A/3u) (du)  ,  (58a) 
(dx)  =  dx  -  (dx1)  -  (d 'x ' )  -  ...  ,  (58b) 
(dx1)  =  A (du)  ,  (58c) 
(d ' x ' )  =  (dA) (du)/2  .  ( 58d ) 


Equation  (58a)  stems  from  (32a),  and  equations  (58b-d)  stem  from  (32b-e),  with 
the  parentheses  introduced  in  accordance  with  the  above  convention. 


We  now  turn  our  attention  to  the  first  adjustment  iteration  and  compare  the 
result  with  that  produced  by  the  first  geometrical  iteration.  In  analogy  to 
(54).  the  outcome  of  the  first  adjustment  iteration  is  represented  by 


■  *(0)*(0)'!‘<lx(0) 


where  the  subscript  (0)  on  the  right-hand  side  indicates  that  thi  pertinent 
quantities  are  associated  with  the  point  P^  generated  in  the  zero  th  iteration. 
To  establish  a  theoretical  link  with  the  geometrical  approach,  we  express  these 
quantities  in  terms  of  their  counterparts  at  P  via  the  Taylor  series  expansion, 
using  the  coordinate  differences  (du)  relating  PQ  to  P: 


a(0)  =  a  +  lda>  +  • •  •  (60a) 

A(0)  =  A  ♦  (dA)  *  . . .  .  (60b) 

dXjQj  -  dx  -  (dx1)  ( d  *  x ' )  -  ...  =  (dx)  .  (60  c ) 

The  parentheses  serve  the  same  purpose  as  in  the  previous  paragraph.  The  first 


equality  in  (60c)  is  essentially  (25a, b),  where  the  variable  point  P  is  taken 
as  the  current  point  P  .  The  second  equality  in  (60c)  stems  from  (58b). 
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3.2  Second  and  Further  Iterations 


We  now  return  to  the  formula  (56)  and  apply  It  to  the  second  geometrical 
iteration.  In  this  case,  du  on  the  left-hand  side  is  written  as  du.  .  ,  while  du 
on  the  right-hand  side  is  du^.  However,  due  to  (64),  du  on  the  right-hand 
side  is  replaced  by 

(du)  +  (du){1)  =  [du]  .  (66) 

Upon  symbolizing  the  quantities  computed  with  [du]  by  brackets,  in  analogy  to 


(57)  and  (58a-d)  we  have 

dUj 2 j  =  (du)  +  a[dAT]g*(dx]  -  aATg*[d’x’]  +  ...  ,  (67) 

where 

[dA]  =  (0A/3u) [du]  ,  (68a) 

[dx]  =  dx  -  [dx']  [ d ' x ' ]  -  ...  ,  (68b) 

[dx ' ]  =  A [du]  ,  (68c) 

[ d 1 x 1 ]  =  [dA] [du]/2  .  ( 68d ) 


With  regard  to  the  second  adjustment  iteration,  in  analogy  to  (59)  we  write 


,du)u>  *  a(uAme“dx(i)  ■ 


where  the  subscript  (1)  indicates  the  quantities  associated  with  the  point 
generated  in  the  first  iteration.  Again,  these  quantities  are  expressed  via  the 
Taylor  series  expansion  at  P,  this  time  upon  using  the  coordinate  differences 
(du)  *■  (du)  j  j  j  relating  to  P  according  to  (65),  i.e.,  using  [du]  from  (66): 

a^j j  =  a  +  [da]  +  . . .  ,  (70a) 

A(1 )  -  A  +  [dA]  +  . . .  ,  (70b) 

dx^jj  =  dx  -  [dx']  -  [d'x']  -  ...  =  [dx]  .  (70c) 

The  first  equality  in  (70c)  again  corresponds  to  (25a, b),  this  time  with  the 
variable  point  P  taken  as  P^,  while  the  second  equality  stems  from  (68b). 
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The  second  adjustment  iteration  generates  the  point  P  .  whose  model  -surface 
coordinates  are 

u(P2)  =  uiPj)  +  (du)(2)  . 

or,  upon  considering  (65), 

u(P2)  u(P)  ♦  (du)  4  (du)(1)  +  (du)(2)  •  (75) 

On  the  other  hand,  the  second  geometrical  iteration  gives  rise  to  the  point 
whose  model -surface  coordinates  are  u(P)+du  .  However,  due  to  (74')  we  have 

V  ^  * 

»(P)  *  dll{2)  -  U(P)  *  (du)  •  '  (du)  - 

revealing  that  the  second  geometrical  and  adjustment  iterations  lead  essentially 
to  the  same  point  P 

In  proceeding  in  the  same  fashion,  we  would  find  for  the  third  iteration: 

[I  *  (da}a*J  ( (du)  }  ^  (du)^]  +  (du)(3)  ~  du(3)  (du)  '  (76) 

Here  {da}  is  a  function  of  {du},  such  that 

(du)  +  ( du ) ( i )  +  (du,(2)  {du*  1 


which  represents  the  coordinate  differences  between  P2  and  P.  Again,  if  the 

magnitude  of  (du)  drops  significantly  visa  vis  (du)  ,  the  iterative 
i  ^  J  [  £  ) 

process  has  already  converged.  Otherwise  the  terms  with  (da}  on  the  left  hand 
side  of  (76)  can  be  disregarded,  yielding 

(du)(1)  •  <du)j2)  f  (du)^3j  -  du^3)  (du)  .  (76') 

The  third  adjustment  iteration  generates  the  point  P  .  whose  model  surface 
coordinates  are 

u(P3)  -  u(P)  ♦  (du)  +  (du)(1)  *  (du)(2)  4  ldu)(3)  •  (77) 

On  the  other  hand,  the  third  geometrical  iteration  gives  rise  to  the  point  with 
the  model  surface  coordinates 

u(p)  4  du(3)  ~  11  (p)  '  (du)  •  ( du )  ^  j  j  *  ( du )  j  2  j  •  (du){3)  , 

where  (76')  has  been  taken  into  account.  But  this  means  that  the  third 
geometrical  and  adjustment  iterations  also  lead  essentially  to  the  same  point 
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4.  CONCLUSIONS 


The  isomorphism  between  a  nonlinear  adjustment  model  with  n  observations 
and  u  parameters  on  one  hand,  and  a  u-dimensional  model  surface  embedded  in  an 
n  dimensional  (flat)  observational  space  on  the  other,  has  been  described  in  the 
Introduction.  To  avoid  unnecessary  repetitions,  it  is  only  mentioned  that  in 
the  geometrical  context,  the  observations  correspond  to  the  observational  space 
coordinates  of  the  point  called  Q  and  the  parameters  correspond  to  the  model 
surface  coordinates  of  the  point  called  P.  The  latter  can  be  perceived  as  a 
result  of  mapping  of  Q  onto  the  model  surface  according  to  a  certain  criterion. 

The  isomorphic  geometrical  setup  has  been  developed  in  Chapter  2  by  means 
of  tensor  structure  and  notations.  Crucial  to  this  development  are  three 
geometrical  notions: 

rs 

1)  The  fundamental  tensors  g  and  ggr  are  constant  throughout  the 
observational  space.  In  the  adjustment  context,  these  tensors  correspond 
respectively  to  the  variance- covariance  and  the  weight  matrices  of  observations. 

a  r 

2)  The  quantities  such  as  du  and  dx  are  regarded  not  only  as  sets  of 
model -surface  and  observational-space  coordinate  differences,  respectively,  but 
also  as  contravariant  components  of  small  vectors  (geometrical  objects).  In  the 
adjustment  context,  the  components  dua  correspond  to  the  adjusted  parameters 
referring  to  the  "Initial  point"  P,  where  the  model  is  expanded  in  the  Taylor 

JP 

series,  and  the  components  dx  correspond  to  the  observations  referring  to  P. 

3)  The  simplest  and  the  most  objective  mapping  of  Q  onto  the  model  surface 
is  the  one  where  the  distance  from  P  to  Q  is  a  minimum,  in  which  case  the 

small  vector  PQ,  denoted  dx,  is  orthogonal  to  the  model  surface.  Since,  in 

the  adjustment  context,  the  length  square  of  dx  corresponds  to  the  standard 
T 

quadratic  form  V  PV,  the  orthogonality  condition  characterizes  the  least  squares 
(L.S.)  criterion,  and,  as  such,  is  adhered  to  throughout  this  study. 

The  primary  objective  in  the  geometrical  resolution  of  a  nonlinear  model  is 
the  determination  of  du°,  the  coordinate  differences  between  the  L.S.  point  P 
and  the  initial  point  P.  An  essential  stepping  stone  in  this  task,  accomplished 
in  Chapter  2,  is  the  determination  of  (chi'*)  which  represents  the  L.S.  solution 
of  the  corresponding  linearized  model.  in  the  latter,  the  role  of  the  model 
surface  Is  substituted  for  by  the  model  plane,  i.e..  a  u  dimensional  plane 


such  vectors  are  replaced  by  the  new  result,  etc.  However,  the  term  (du)  on  the 
right  hand  side  is  fixed  throughout.  When  the  results  no  longer  change,  or 
change  by  insignificant  amounts,  the  convergence  has  been  achieved. 

In  using  the  above  algorithm,  one  can  compute  the  value  of  the  i, .S. 

quadratic  form  in  every  iteration,  corresponding  to  the  length  square  of  the 

-T 

vector  PQ.  The  latter  is  given  in  matrix  notations  as  dx  g*dx.  This 
computation  is  virtually  effortless  because  the  product  g*dx  must  be  evaluated 
in  any  event  as  a  part  of  the  iterative  process.  The  knowledge  of  the  L.S. 
quadratic  form  at  every  step  is  especially  useful  if  the  possibility  exists  that 
the  solution  could  converge  to  a  maximum  rather  than  to  a  minimum.  Since  the 
current  algorithm  is  based  on  the  orthogonality  condition,  it  is  unable  to 
distinguish  between  the  two  extremes  in  the  case  where  both  occur  within  a  small 
neighborhood  of  P.  The  computation  of  the  L.S.  quadratic  form  can  quickly 
discern  such  a  difficulty,  which  can  then  be  rectified  upon  choosing  a  different 
initial  point  P. 

A  comparison  between  the  new  "geometrical  iterations"  and  the  standard 
"adjustment  iterations"  has  been  carried  out  in  Chapter  3  and  illustrated  in  the 
Appendix.  It  has  been  shown  that  the  two  processes  converge  at  almost  exactly 
the  same  pace,  generating  nearly  identical  model  surface  points  at  every  step 
beyond  the  common  zeroth  iteration.  This  similarity  in  outcome  occurs  in  spite 
of  a  significant  operational  disparity.  Whereas  each  adjustment  iteration 
involves  a  numerical  reformulation  of  the  linearized  model,  including  the 
inversion  of  the  matrix  of  normal  equations,  the  geometrical  iterations  keep  the 
quantities  which  are  independent  of  the  solution  fixed.  The  benefit  of  the 
geometrical  approach  thus  becomes  clearer.  Unlike  the  standard  procedure,  this 
approach  does  not  benefit  from  a  fresh,  albeit  expensive,  start  at  every  step, 
yet  it  produces  the  same  result  in  the  same  number  of  iterations. 


The  basic  adjustment  setup  consists  of  dx ,  g* ,  A,  and  {9A/du},  defined 
in  (30a-d).  The  values  in  dx  follow  from  (A. 4)  and  (A. 6),  namely 

-0.19 
-1.07 
0.2275 

According  to  an  earlier  statement,  g  and  g*  are  given  by 

g  =  g*  =  I  •  (A. 8) 

Upon  substituting  the  values  from  (A. 5)  into  (A. 2),  the  design  matrix  associated 
with  P  becomes 


A  *  |  0  1  (A. 9) 

[_1.98  10.72 

The  array  {8A/3u}  is  constant,  and  has  already  been  presented  in  (A. 3a  c). 
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If  the  aodel -surface  coordinates  (A. 11)  are  utilized  also  in  (A. 2),  the  design 
Matrix  becoMes 


A (new) 


1.9966618  10.8028834 


(A. 13) 


In  (A. 12)  and  (A. 13),  the  symbol  (new)  is  used  in  lieu  of  (0)  as  it  appeared 
e.g.  in  equation  (59). 

In  paralleling  (A.IOa-c),  one  computes  the  pertinent  matrices  in  the  first 
adjustment  iteration: 


a*(new)  = 


4.9866584  21.5697048 

21.5697048  117.7022895 


a (new)  = 


0.9672389  -0.1772528 

0.1772528  0.0409787 


P(new)  = 


0.9672389  -0.1772528  0.0164079 

-0.1772528  0.0409787  0.0887746 


The  solution  vector  in  this  iteration  is  denoted  (du)^  in  agreement  with  the 
notation  in  Chapter  3.  Similar  to  (A.IOd),  but  with  P(new)  and  dx(new) 
replacing  P  and  dx ,  this  vector  is  computed  as 


r0. 00013541 

i 

L0 . 0000006J 


(A. 14) 


The  updated  point  P  in  the  first  adjustment  iteration  is  denoted  P ^ ,  again  in 
agreement  with  Chapter  3.  Its  model -surface  coordinates  are  obtained  by  adding 


(A. 14)  to  (A. 11): 


1 . 1996260 
| 1 .300071 lj 


The  second  adjustment  iteration  proceeds  in  analogy  to  the  preceding  two 
paragraphs,  except  that  it  is  based  on  (A. 15)  instead  of  (A. 11).  The  final 
results  paralleling  (A. 14)  and  (A. 15)  are 
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where  the  array  {3A/0u},  the  matrix  A,  and  the  vector  dx,  presented 
respectively  in  (A.3a-c),  (A. 9),  and  (A. 7),  are  fixed.  Since  the  current 
adjustment  model  is  quadratic,  the  dots  in  (A. 20)  and  (A.21d)  can  be  discarded. 


In  the  first  geometrical  iteration,  du  in  (A.21a-c)  is  replaced  by  (du) 
from  (A.IOd).  The  indicated  operations  result  in 


0.0166618  0.0828834 


0.0094906 


dx'  =  i  0.0100705  ,  d'x'  = 


L0. 1267471 


0 


0.0004964 


j  0.1994906 
dx  -  -1.0800705 


0. 1002565 


The  utilization  of  the  new  quantities  dA,  d'x',  and  dx  in  (A. 20)  yields  the 
solution  in  the  first  geometrical  iteration: 


To. 0096265 
du  =  ! 

I1)  [0.0100714 


(A. 22) 


This  result  gives  rise  to  a  point  described  by  the  model  surface  coordin  tes 


1 . 1996265 

«  ^  dum  = 

°  l 1  *  1.3000714 


(A. 23) 


where  uq  has  been  listed  in  (A. 5).  Upon  consulting  (A. 15),  the  new  point  is 
seen  to  be  nearly  identical  to  P  generated  in  the  first  adjustment  iteration. 


The  second  geometrical  iteration  follows  the  procedure  just  described, 


except  that  du  in  (A.21a-c)  is  now  replaced  by  du 


The  solution  is 


0.0097142 

0.0100544 


(a. 24 : 


which,  when  added  to  uq  in  (A. 5),  gives  rise  to  another  point,  whose  model 
surface  coordinates  are 


"o  *  duU) 


1  . 1997142 
1 .3000544 


(A. 25) 


A . 4  Adjustment  Results  and  Verifications 
In  using  (A. 28)  in  (A.21a-e),  one  obtains 


dA 


0  0 

0  0 

0.02  0.08 


(A. 30) 


0.01 

0 

1 

r- 0.201 

II 

X 

T5 

0.01 

,  d '  x '  = 

0 

1 

dx  = 

00 

o 

0.127 

0.0005 

0. 10J 

The  column  vector  dx  represents  the  minus  residuals.  The  final  L.S.  quadratic 
form  is 


dx  g*dx 


1.2164  . 


Although  not  listed,  this  quadratic  form  has  been  computed  throughout  the 
iterative  process,  and  has  been  noted  to  converge  to  its  minimum  value  presented 
above.  The  adjusted  parameters  are  given  by  (A. 29)  as 


u 


1.20 

1.30 


while  the  adjusted  observations  follow  from  (46)  as 


(A. 31 ) 


x 


1.20 

1.30 

8.22 


(A. 32) 


Equation  (A. 32)  is  confirmed  by  the  direct  evaluation  of  (A.la-c)  with  the 
1  2 

values  of  u  and  u  from  (A. 31). 


In  adding  dA  from  (A. 30)  to  A  given  by  (A. 9),  one  obtains  the  design  matrix 
at  the  L.S.  point  P: 


A 


—  — 
1  0 

0  1  j 

2  10. 8 J 


(A. 33) 


This  outcome  is  also  established  directly  upon  utilizing  the?  values  of  u1  and  u2 
from  (A. 31)  in  (A. 2).  A  useful  verification  is  offered  by 
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